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Abstract 

Generalized relational theories with null values in the sense of Reiter are first-order the- 
ories that provide a semantics for relational databases with incomplete information. In 
this paper we show that any such theory can be turned into an equivalent logic program, 
so that models of the theory can be generated using computational methods of answer 
set programming. As a step towards this goal, we develop a general method for calcu- 
lating stable models under the domain closure assumption but without the unique name 
assumption. 



1 Introduction 



We re-examine here some of the problems discussed in two important papers on the 



semantics of null values that were published many years ago. The first of them is 
Ray Reiter's paper "Towards a logical reconstruction of relational database theory" 
(jReiter 1984[) . Generalized relational theories with null values in the sense of Rcitcr 
are first-order theories that provide a semantics for relational databases with in- 
complete information. The incompleteness can be of two kinds. One is represented 



X 

by inclusive disjunction; for instance, the formula 



iSUPPLIES(iFoo, p x ) V iSUPPLIES(iFoo, p 3 ) (1) 

says: 1F00 supplies p\ or pa, maybe both. The other is represented by null values; 
by writing 

iSUPPLIER(lo), iSUPPLIES{uj, P3 ), (2) 

where a; is a null value, we express that some supplier, which may or may not 
already be in the database, supplies ^3. 

The second paper, by Bonnie Traylor and Michael Gelfond, is entitled "Repre- 
senting null values in logic programming" ( |Traylor and Gelf ond 1994|. The authors 
define, among other things, the "logic programming counterpart" of a generalized 
relational theory with null values — a logic program whose meaning under the answer 
set semantics is similar to the meaning of the theory under the standard semantics 
of first-order logic. 
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We propose here an alternative approach to turning Reiter's theories into logic 
programs, which represents the meaning of the theory more closely than the trans- 
lation from ( |Traylor "an d Gclfond 1994 1. We show also how these logic programs 



can be executed using computational methods of answer set programming (ASP) 
(Marek and Truszczyhski 1999 Nicmcla 1999; Lifschitz 2008|) — for instance, by run- 



ning the answer set solver CLINGO0 

The difference between null values and other object constants emphasized in 
Reiter's semantics is that null values are exempt from the unique name assump- 
tion: a null value may represent an object that has a name in the database, and 
two different null values may represent the same object. This fact leads us to the 
general problem of using answer set solvers for calculating the stable models that 
satisfy the domain closure assumption but may not satisfy the unique name as- 
sumption. Such models are allowed in some versions of the stable model semantics 
(|Ferraris et al. 2007) IFerraris et al. 20"TT|) . just as they are allowed in the defini- 
tion of circumscription ( [McCarthy 19 80 McCart hy 1986[ ). But existing answer set 
solvers do not deal with stable models of this kind directly. To take a simple exam- 
ple, the formula 

P(a)VP(b) (3) 

has minimal models of three kinds: in some of them, P(a) is true, and P(b) is 
false; in others, P(a) is false, and P(b) is true; finally, there are minimal models 
in which both P(a) and P(b) are true, along with the formula a = b. We will see 
how syntactic expressions describing these three possibilities can be generated by 
an answer set solver. Our method is applicable, in particular, to logic programs 
representing relational theories with null values. 

The word "generalized" in Reiter's terminology indicates the possibility of in- 
cluding disjunctive information, and in the rest of the paper it will be omitted. 



2 Relational Theories without Null Values 

2.1 Review of Reiter's Semantics of Relational Theories 

We begin with a signature that consists of finitely many object and predicate con- 
stants. A positive ground clause is a formula of the form Ai V •• -V A r (r > 1), where 
each Ai is a ground atomic formula whose predicate is distinct from the equality 
symbol. For instance, (JIJ is a positive ground clause. For any finite set A of positive 
ground clauses, the corresponding relational theory Ta is the set consisting of the 
following sentences: 

• the domain closure axiom iDCA: 

Vx W x = a 

a 

where the disjunction extends over all object constants a; 
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• the unique name axioms a ^ b for all pairs of distinct object constants a, 6; 

• the clauses A; 

• for each predicate constant P, the completion axiom 



Vx 



P(x)-> \/ x = 



a£ Wp 



(4) 



where x is a tuple of distinct object variables, and Wp is the set of all tuples a 
of object constants such that P(&) belongs to a clause from 

In view of the domain closure axiom iDCA and the unique name axioms, any 
model of Ta is isomorphic to a Herbrand modelH Consequently, in the discussion 
of models of Ta we can restrict attention to Herbrand models. 

Consider, for instance, Example 4.1 from (jReiter 1984[) . Its signature includes 
the object constants 

Pi, P2-, P3, lAcme, 1F00, 
the unary predicate constants 

iPART, iSUPPLIER, 
and the binary predicate constants 

iSUPPLIES, iSUBPART- 
The set A describes the following supplier and parts world: 

PART SUPPLIER SUPPLIES SUBPART 
pi lAcme (lAcme p\) (pi p 2 ) 

P2 1F00 (1F00 p 2 ) 

P3 

In other words, it includes the corresponding atomic formulas: 

iPART{ Pl ), iPART(p 2 ), iSUBPART( Pl ,p 2 ) ■ (5) 
In addition, A includes clause (P). 



2 The equality between two tuples of terms of the same length, such as x = a, stands for the 
conjunction of the equalities between the corresponding members of the tuples. We do not 
include equality axioms from (Reitcr 1984) because we assume here the ve rsion of the semantics 
of first-order formulas that treats equality as identity (see, for instance, ( Lifschitz ct al. 2008 
Section 1.2.2)). 

3 Recall that in the absence of function constants of arity > a Herbrand interpretation is an 
interpretation such that (i) its universe is the set of all object constants, and (ii) each object 
constant is interpreted as itself. A Herbrand interpretation can be identified with the set of all 
ground atomic formulas that are true in it and whose predicate is distinct from the equality 
symbol. 



4 



Vladimir Lifschitz, Karl Pichotta, and Fangkai Yang 



The completion axioms in this example are 



\/x(\PART{x) ->i = j)iVj; = P2Vi = p 3 ), 
Vx(iSUPPLIER(x) ->• x = lAcme Vi = 1F00), 
Vxy(iSUPPLIES(x, y) -> (x = lAcme A y = p\) 



V (x = \Foo A y = P2) 

V (x = \Foo A y = p\) 

V (x = 1F00 A y = p 3 )), 



\/xy(iSUBPART(x, y) -> (x = p 1 A y = p 2 ))- 



Theory T\ has 



3 Herbrand models: 



h 
h 
h 



I U {iSUPPLIES{iFoo, pi)}, 
I U {iSUPPLIES(iFoo, pa)}, 

I U {iSUPPLIES(iFoo, pi), iSUPPLIES(iFoo, p 3 )}, 



where / is the set of atomic formulas §5§ . 

Note that ^3 is not a minimal model of both I\ and I2 are proper subsets 
of I3. In the presence of disjunction, Rcitcr's completion axioms (j4]) guarantee only a 
weak form of minimality. A similar condition is used in the definition of the possible 
model semantics of disjunctive logic programs (jSak ama and Inoue 1994|) . 



For any set A of positive ground clauses, by ITa we denote the set of rules 



for all clauses A% V • • ■ V A r from A. Recall that this is an expression in the input 
language of CLiNGcE that allows us to decide arbitrarily whether or not to include 
the atomic formulas A\, . . . , A r in the answer set as long as at least one of them is 
included. 

The translation 1{^4} of a unit clause A is strongly equivalent ( Lifsc Stz et al. 20011 
ILifschitz et al. 2007[) to the fact A. Using this simplification we can say, for instance, 
that the logic program representing the example above consists of the facts ([5]) and 
the rule 



Furthermore, this program can be made more compact using the CLINGO conven- 
tions that allow us to use semicolons to merge a group of facts into one expression: 

part(pl;p2;p3) . 
supplier (acme ;f 00) . 
supplies (acme , pi ; ;foo,p2) . 
subpart (pi ,p2) . 

l{supplies (f 00 ,pl) , supplies (f 00 ,p3) } . 

4 Such expressions, "cardinality constraints," appeared originally as part of the input language 
of the grounder LPARSE (http://www.tcs.hut.fi/Software/smodels/lparse.ps.gz). 



2.2 Representing Relational Theories by Logic Programs 



l{Ax,...,A r } 



(6) 



1{iSUPPLIES(iFoo, px), iSUPPLIES(iFoo, p 3 )}- 
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Given this input, CLINGO returns 3 answer sets: 
Answer: 1 

part (pi) part(p2) part(p3) supplier (acme) 
supplier (foo) supplies (foo ,p2) supplies (acme , pi) 
subpart (pi ,p2) supplies (foo ,p3) 
Answer: 2 

part (pi) part(p2) part(p3) supplier (acme) 
supplier (foo) supplies (foo ,p2) supplies (acme , pi) 
subpart (pi ,p2) supplies (foo , pi) 
Answer: 3 

part (pi) part(p2) part(p3) supplier (acme) 
supplier (foo) supplies (f oo ,p2) supplies (acme ,pl) 
subpart (pi ,p2) supplies (foo , pi) supplies (foo ,p3) 

These answer sets are identical to the Herbrand models of the corresponding rela- 
tional theory. This is an instance of a general theorem that expresses the correctness 
of our translation: 

Theorem 1 

For any set A of positive ground clauses, a Herbrand interpretation / is a model 
of Ta iff I is an answer set of ITa • 

Proofs of theorems, including a combined proof of Theorems 1 and 2, can be 
found at the end of the paper. 



3 Null Values 

3.1 Review of Reiter's Semantics of Null Values 

We turn now to a more general framework. As before, the underlying signature is 
assumed to consist of finitely many object and predicate constants. We assume that 
the object constants are classified into two groups, the database constants and the 
null values. About a unique name axiom a ^ b we say that it is required if both a 
and b are database constants, and that it is optional otherwise. As before, A stands 
for a finite set of positive ground clauses. Let E be a set of optional unique name 
axioms. The relational theory with null values Ta,e is the set of sentences obtained 
from Ta by removing all optional unique name axioms that do not belong to E. In 
other words, Ta.s consists of 

• the domain closure axiom iDCA, 

• all required unique name axioms, 

• the optional unique name axioms from E, 

• the clauses A; 

• the completion axioms (0]). 

Consider, for instance, the modification of our example in which 

• the object constant cj is added to signature as the only null value, 
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• clause (Q} is replaced in A with clauses ([2]), and 

• S = W ^ Pl,^ P2, W ^ p 3 }- 

Thus u) is assumed to be a supplier that supplies part p 3 ] it may be identical to 
one of the suppliers lAcme, 1F00 or may be different from both of them, and it is 
certainly different from p\ 1 p2, p 3 . The completion axioms in this example are 

Vx(iPART(x) ->i = piVi=p2Vi = P3), 
\/x{\SUPPLIER(x) -4 x = lAcme Vi = 1F00 Vi=u), 
Wxy(iSUPPLIES{x, y) -> (x = \Acme Ay = pi) 

V (x — 1F00 A y = P2) 

V (x = uj A y = p 3 )), 
Mxy{\SUBPART(x, y) —± (x = pi Ay = p 2 ))- 

The set of unique name axioms of 2a, s includes neither uj ^ lAcme nor uj ^ 1F00. 
Accordingly, this theory has models of different kinds: some of them satisfy u> = lAcme; 
some satisfy uj — 1F00; in some models, both equalities are false. We will later return 
to this example to give a complete description of its models. 

3.2 Representing Theories with Null Values by Logic Programs 

In Section [2. 21 we saw how Reiter's semantics of disjunctive databases can be refor- 
mulated in terms of stable models. Our next goal is to do the same for databases 
with null values. 

Since the axiom set 2a, s may not include some of the optional unique name 
axioms, it may have models that are not isomorphic to any Herbrand model. For 
this reason, the problem of relating 2a, s to logic programs becomes easier if we 
start with a semantics of logic programs that is not restricted to Herbrand models. 

A version of the stable model semantics that covers non-Herbrand models is 
described in (jFerraris et al. 20111 Section 2)@ That paper deals with models of a 
first-order sentence and defines under what conditions such a model is considered 
stable relative to a subset p of the predicate constants of the underlying signature. 
The predicates from p are called "intensional." Unless stated otherwise, we will 
assume that p consists of all predicate constants of the underlying signature, so that 
every predicate constant (other than equality) is considered intensional. When this 
definition of a stable model is applied to a logic program, each rule of the program 
is viewed as shorthand for a first-order sentence, and the program is identified with 
the conjunction of these sentences. For instance, rule ((6]) can be viewed as shorthand 
for the formula 

r r 

f\ (Ai V ->Ai) A \J Ai- 

i=l i=l 

(The first conjunctive term says, "choose the truth value of each Ai arbitrarily" ; 
the second term requires that at least one of these atoms be made true.) 

5 Other possible approaches to the semantics o f logic programs that are not limited to Herbrand 
models use program completion ( Clark 1978 ) without Clark's equality axioms and the logic of 
nonmonotone inductive definitions (Dcncckcr and Tcrnovska 2008). 
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The paper referenced above defines a syntactic transformation SM p that turns a 
first-order sentence F into a conjunction 

FA- 

where the dots stand for a second-order sentence (the "stability condition"). The 
stable models of F are defined as arbitrary models (in the sense of second-order 
logic) of SM p [F]. 

From this perspective, Theorem [1] asserts that a Herbrand interpretation is a 
model of Ta iff it is a model of SM p [IIa], where p is the set of all predicate 
constants of the underlying signature. 

By IIa,£ we denote the conjunction of IIa with iDCA and with all unique name 
axioms from Ta,e (that is to say, with all unique name axioms except for the 
optional axioms that do not belong to S). The following theorem expresses the 
soundness of this translation: 

Theorem 2 

For any set A of positive ground clauses and any set E of optional unique name 
axioms, Ta,s is equivalent to SM p [IIa,s], where p is the set of all predicate con- 
stants. 

In other words, an interpretation I is a model of Ta,e iff I is a stable model 

of n A ,s- 

One useful property of the operator SM p is that 

SM P [F A G] is equivalent to SM p [F] A G 

whenever G does not contain intensional predicates (that is, predicate constants 
from p)H For instance, let 11^ s be the conjunction of IIa with the unique name 
axioms from Ta.s; then IIa,e is 11^ s AiDCA. Since iDCA does not contain inten- 
sional predicates (recall that all atomic parts of iDCA are equalities), SM p [IIa,s] is 
equivalent to SM P [II^ s ] A \DCA. The assertion of Theorem [2] can be reformulated 
as follows: an interpretation J is a model of Xa,s iff J is a stable model of 11^ E 
that satisfies iDCA. 

As we have seen, the translation IIa makes it possible to generate models of Ta 
using an answer set solver. Unfortunately, the translation IIa,e does not do the 
same for relational theories with null values. In the presence of null values we are 
interested in non-Hcrbrand models (for instance, in the models of the theory from 
the example above that satisfy uj = lAcme), but answer set solvers are designed 
to generate Herbrand stable models only. There is also a more basic question: a 
Herbrand interpretation can be viewed as a set of ground atomic formulas, but how 
will we describe non-Herbrand models by syntactic expressions? These questions 
are addressed in the next section. 



See l|Ferraris et al. 20111 Section 5.1). 
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4 Calculating General Stable Models 
4-1 Diagrams 

Consider a signature a consisting of finitely many object and predicate constants. 
By iHB a we denote the Herbrand base of a, that is, the set of its ground atomic for- 
mulas whose predicate is distinct from the equality symbol. By \EHB a ("extended" 
Herbrand base) we denote the set of all ground atomic formulas, including equali- 
ties between object constants. For any interpretation / of a satisfying the domain 
closure axiom (lDCA-interpretation, for short), by D(I) we will denote the set of 
the formulas from \EHB a that are true in /. This set will be called the diagram 
of/0 

If a subset X of \EHB a is the diagram of a lDCA-interpretation then it is clear 
that 

• the set of equalities in X is closed under reflexivity (it includes a — a for every 
object constant a), symmetry (includes b = a whenever it includes a = b), 
and transitivity (includes a — c whenever it includes a — b and b = c), and 

• X is closed under substitution: it includes P(b\, . . . , b n ) whenever it includes 
P(ai, . . . , On), % = b\,. . . , a n = b n . 

The converse holds also: 

Theorem 3 

If a subset X of \EHB a is closed under substitution, and the set of equalities in X 
is closed under reflexivity, symmetry, and transitivity, then there exists a iDCA- 
intcrprctation / such that D(I) = X . Furthermore, this interpretation is unique up 
to isomorphism. 

Since relational theories with null values include the domain closure assumption, 
Theorem [3] shows that their models can be completely characterized by diagrams. 
In the example above, the theory has 3 non-isomorphic models J\, J2, J3. The 
diagram of J\ consists of the formulas (0, ([2]), and a = a for all object constants a. 
The diagrams of the other two are given by the formulas 

h = J\ U {iSUPPLIES(iAcme,p 3 ),iSUPPLIES{u, Pl ),uj = iAcme,iAcme = oj}, 
J 3 = Ji U {iSUPPLIES(iFoo,p 3 ),iSUPPLIES(u,p 2 ),u = iFoo,iFoo = lu}- 

4-2 Reducing Stable DC A- Models to Herbrand Stable Models 

The problem that we are interested in can be now stated as follows: Given a first- 
order sentence F, we would like to construct a first-order sentence F' such that the 
diagrams of all 1DCA- interpretations satisfying SM p [F] can be easily extracted from 
the Herbrand interpretations satisfying SM P [F']. We say "can be easily extracted 
from" rather than "are identical to" because diagrams include equalities between 
object constants, and Herbrand models do not; occurrences of equality in F will 

7 This is essentially the "positive diagram" of /, as this term is used in model theory 
( Robinson 1963, Section 2.1), for the special case of lDCA-intcrprctations. 



Relational Theories with Null Values 



9 



have to be replaced in F' by another symbol. Our goal, in other words, is to define F' 
in such a way that diagrams of the stable lDCA-models of F will be nearly identical 
to Herbrand stable models of F 1 . 

The examples of F that we are specifically interested in are the formulas 11^ s , 
because stable iDC A- models of that sentence are identical to models of Ta.e- As 
a simpler example, consider formula ([3]). It has 3 minimal lDCA-models, with the 
diagrams 

K x = {P(a) 7 a = o,6= 6}, 

K 2 = {P(b), a = a, b = b}, (7) 
K 3 = {P(a),P(b), a = a,b = b,a= b,b = a}- 

Our translation F n> F' will allow us to construct these diagrams using ASP. 

The solution described below uses the binary predicate constant lEq, which is 
assumed not to belong to a. For any first-order formula F of the signature a, Fg 
stands for the formula of the signature erU {iEq} obtained from F by replacing each 
subformula of the form t\ — t% with iEq(t\, fe). (Here t±, are terms, that is, object 
constants or object variables.) The notation Xg q , where X is a set of formulas of 
the signature a, is understood in a similar way. By E a we denote the conjunction 
of the logically valid sentences 

^Jx(x = x)- 
Mxy(x = y -> y = x), 
\fxyz(x = yAy = z^x = z), 

and 

Vxy(P(x)Ax = y^P(y)) 

for all predicate constants P from a, where x, y are disjoint tuples of distinct 
variables. 

In the statement of the theorem below, F is an arbitrary sentence of the signa- 
ture <7, and p stands for the set of all predicate constants of a. 

Theorem 4 

For any lDCA-interprctation I of the signature a that satisfies SM P [F], the Her- 
brand interpretation D(I)^. q of the signature a U {iEq} satisfies 

SM p [(f A (8) 

Conversely, any Herbrand model of this formula is D(I) = ^, g for some lDCA-interpreta- 
tion I oi a satisfying SM p [F] . 

In other words, the transformation / i-)- D(I)^ maps the class of stable iDCA- 
models of F onto the set of Herbrand stable models of (F A E a ) = ^ q . The second part 
of Theorem [3] shows that this transformation is one-to-one up to isomorphism. 

By Theorem 2 from ([Ferraris et al. 201 ip . formula (J5|) is equivalent to 

SMp, B ,[(F A E a )% A Vxy(iEq{x, y) V -aEq(x, y))} ■ (9) 

The advantage of this reformulation is that it treats all predicate constants of the 
signature aU{iEq} as intensional. This is essential for our purposes, because existing 
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answer set solvers calculate Herbrand stable models under the assumption that all 
predicate constants occurring in the program (except for "predefined predicates") 
are intensional. 

For example, the diagrams (J7]) of the minimal DCA-models of <j3j> are identical, 
modulo replacing = with lEq, to the Herbrand stable models of the conjunction of 
the formulas ([3]), 

\/xiEq(x, x), 

Mxy{iEq{x,y) -+iEq(y,x)), (10) 
\/xyz(iEq(x, y) A \Eq{y, z) — > iEq(x, z)), 

Vxy(P(x) A iEq(x,y) P(y)), 

and 

\fxy{iEq(x, y) V -,iEq(x, y)) ■ (11) 

In logic programming syntax, this conjunction can be written as 

p(a) |p(b) . 
eq(X,X) . 

eq(X,Y) :- eq(Y,X) . 
eq(X,Z) :- eq(X,Y) , eq(Y,Z) . 
p(Y) :- p(X) , eq(X,Y) . 
{eq(X,Y)>. 

To make this program safell we need to specify that the only possible values of the 
variables X and Y are a and b. This can be accomplished by including the lines 

u(a;b) . 

#domain u(X) . #domain u(Y) . 
#hide u/1. 

(The auxiliary predicate symbol u describes the "universe" of the program.) Now 
the program can be grounded by GRINGO, and its Herbrand stable models can be 
generated by claspdH The output 

Answer: 1 

eq(b,b) eq(a,a) p(b) 
Answer: 2 

eq(b,b) eq(a,a) p(a) 
Answer: 3 

eq(b,b) eq(a,a) eq(b,a) eq(a,b) p(a) p(b) 

8 Safety is a syntactic condition required for "intelligent instantiation" — part of the process of 
generating answer sets. In the program above, the rules eq(X,X) and {eq(X,Y)} are unsafe. 

9 GRINGO and CLASpD are "relatives" of CLINGO; see Footnote (JD) for a reference. CLINGO 
itself cannot be used in this case because the program is disjunctive. CMODELS (http: 
//www. cs .utexas . edu/users/tag/cmodels .html) would do as well. Using the solver DLV 
(http : //www, dlvsyste m. com [ I will become an option too after eliminating choice rules in fa- 
vor of disjunctive rules with auxiliary predicates. We are grateful to Yuliya Lierler for helping 
us identify the software capable of executing this program. 
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is essentially identical to the list (J7J of minimal models, as could be expected on 
the basis of Theorem [4] 

The Python script nonH.py (for "non-Herbrand" ) is a preprocessor that turns 
a program F of a signature a without function symbols of arity > 0, written in the 
input language of GRINGO, into the program 

(F A E a )= Eq A Vxy(iEq(x, y) V -iEq(x, y)), 

written in the language of GRINGO also. Thus the Herbrand stable models of the 
output of nonH.py are the diagrams of the stable lDCA-models of the input (with 
equality replaced by iEq). As in the example above, a "universe" predicate is used 
to ensure that whenever the input of nonH.py is safe, the output is safe also. The 
diagrams of the minimal lDCA-models of formula can be generated by saving 
that formula, in the form 

p(a) |p(b) . 

in a file, say disjunction. lp, and then executing the command 
% nonH.py disjunction . lp I gringo I claspD 

(the CLASPD option instructs it to generate all answer sets, not one). The script 
can be downloaded from |http : / / www . cs . utexas . edu/ users? f kyang/nonH/. 

4-3 Calculating Models of a Relational Theory with Null Values 

The method applied above to the disjunction P(a) VP(b) can be applied also to the 
formula 11^ s . Stable lDCA-models of this formula can be generated using CLINGO 
with the preprocessor nonH.py. The preprocessor has two options that can be 
useful here. The command line 

°/o nonH.py <filename> -una <list of constants> 

instructs the preprocessor to conjoin its output with the unique name axioms a b 
for all pairs a, b of distinct object constants from the given list. The command line 

°L nonH.py <filename> -no-una <list of constants> 

adds the unique name axioms a =/= b for all pairs a, b of distinct object constants 
such that at least one of them does not occur in the given list. The diagrams of 
models of our relational theory with null values can be generated by saving the 
rules 

part(pl;p2;p3) . 

supplier (acme ;foo; omega) . 

supplies (acme , pi ; ;foo,p2; ; omega, p3) . 

subpart (pi ,p2) . 

:- omega==pl. 

: - omega==p2 . 

: - omega==p3 . 
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in a file, say db . lp, and then executing the command 
7o nonH.py db.lp -no-una omega I clingo 
The following output will be produced: 
Answer: 1 

part(pl) part(p3) part(p2) supplier (acme) supplier (omega) supplier (foo) 
supplies (omega, p3) supplies(foo,p2) supplies (acme , pi) subpart (pi ,p2) 
eq(omega, omega) eq(foo,foo) eq(acme , acme) eq(p3,p3) eq(p2,p2) eq(pl,pl) 
eq(omega, f oo) eq(foo , omega) supplies (omega, p2) supplies(foo,p3) 
Answer: 2 

part(pl) part(p3) part(p2) supplier (acme) supplier (omega) supplier (foo) 
supplies (omega, p3) supplies(foo,p2) supplies (acme , pi) subpart (pi ,p2) 
eq(omega, omega) eq(foo,foo) eq(acme , acme) eq(p3,p3) eq(p2,p2) eq(pl,pl) 
Answer: 3 

part(pl) part(p3) part(p2) supplier (acme) supplier (omega) supplier (foo) 
supplies (omega, p3) supplies(foo,p2) supplies (acme , pi) subpart (pi ,p2) 
eq(omega, omega) eq(foo,foo) eq(acme , acme) eq(p3,p3) eq(p2,p2) eq(pl,pl) 
eq(omega, acme) eq ( acme , omega) supplies (acme ,p3) supplies(omega,pl) 

It is essentially identical to the set of diagrams J\ , Ji , J3 . 

4-4 Comparison with the Trayloi Gelfond Translation 

The approach to encoding relational theories with null values by logic programs 
proposed in ( |Traylor and Gelfond 1994| ) does not have the property established 
for IIa,e in Theorem [2j generally, there is no 1-1 correspondence between the 
models of Ta.s and the answer sets of the Traylor — Gelfond translation. For in- 
stance, the logic programming counterpart of our main example in the sense of 
flTraylor and Gelfond 1994] ) has 2 answer sets, not 3. It uses strong (classical) nega- 
tion ([Gelfond and Lifschitz 199lj) . and its answer sets are incomplete sets of literals. 
One of them, for instance, includes iSUPPLIES(iFoo, p\) but does not include either 
of the two complementary literals iSUPPLIES(iFoo,p 3 ), -^SUPPLIES (1F00, p 3 ). 
This is how the program expresses the possibility of p 3 being supplied by 1F00, along 
with pi. The result of ( |Traylor and Gelfond 199 4) describes the relation of T"a,s to 
the intersection of the answer sets of its logic programming counterpart, not to the 
individual answer sets. 

Logic programming counterparts in the sense of ( [Traylor and Gelfond 1994| , like 
our programs IIa.s, can be turned into executable ASP code. The reason why 
that was not done in that paper is simply that the paper was written too early — 
the first answer set solver appeared on the scene two years after its publication 
([Niemela, and Simons 1 996). 

5 Proofs of Theorems 
5. 1 Proofs of Theorems [7] and []| 
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Lemma 1 

For any finite set A of positive ground clauses, formula SM p [iTa] is equivalent to 
the conjunction of the clauses A and the completion axioms (j4]). 



Proof 

Let C be the conjunction of the formulas 
P(a) V -.P(a) 



(12) 



for all atomic formulas P(a) occurring in A. It is clear that IIa is strongly equiv- 
alent0 to the conjunction of C with the formulas 



(13) 



for all clauses AiV- ■ -\/A r from A. According to Theorem 3 from ([Ferraris et al. 201 1[) . 
it follows that SM p [IIa] is equivalent to the conjunction of SM p [C] with for- 
mulas (TT3|) . Furthermore, (IT21) is strongly equivalent to -i-P(a) — > P(a). Con- 
sequently C is strongly equivalent to the conjunction of the formulas 



Vx 



\f (-.-. P(x) A x = a) P(x) 

.ag Wp 



for all predicate constants P. By Theorem 11 from ([Ferraris et al. 201 ip . it follows 
that SM p [C] is equivalent to 



P(x) <-> Y (nnP( X )AX = a) 



(14) 



Vx 



It remains to observe that (113[) is equivalent to Ai V • • • V i4 r , and that ()T4[) is 
equivalent to (HI). □ 



Theorem [U For any set A of positive ground clauses, a Herbrand interpretation I 
is a model of Ta iff I is an answer set of IIa- 



Proof 

A Herbrand interpretation is a model of Ta iff it satisfies the clauses A and the 
completion axioms ((4]). On the other hand, a Herbrand interpretation is an answer 
set of Ha iff it satisfies SM p [ILa]- Consequently the assertion of the theorem follows 
from Lemma [TJ □ 



Theorem [2j For any set A of positive ground clauses and any set S of optional 
unique name axioms, Xa,£ is equivalent to SM p [IIa,s]; where p is the set of all 
predicate constants. 
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Proof 

Recall that IIa,e is IIa A \DCA A U, where U is the conjunction of all unique 
name axioms from 7a. s- Since neither iDCA nor U contains intensional predi- 
cates, SM p [IIa,s] is equivalent to SM p [IIa] A iDCA A U. By Lemma [IJ it follows 
that SM p [IIa,e] is equivalent to the conjunction of the clauses A, the completion 
axioms (JH), and the formulas iDCA and U ; that is to say, it is equivalent to Ta,s- 
□ 

5.2 Proof of Theorem^ 

Theorem [3j If a subset X of \EHB a is closed under substitution, and the set of 

equalities in X is closed under reflexivity, symmetry, and transitivity, then there ex- 
ists a iDCA-interpretation I such that D{I) = X. Furthermore, this interpretation 
is unique up to isomorphism. 

Proof 

The binary relation 

a = b is in X (15) 

between object constants a, b is an equivalence relation on the set of object con- 
stants. For any predicate constant P, the n-ary relation 

P(ai,...,a n ) is in X (16) 

between object constants a±, . . . , a n can be extended to equivalence classes of (|15p . 
Consider the interpretation / such that 

• the universe of I is the set of equivalence classes of relation fj 15|) , 

• I interprets each object constant a as the equivalence class that contains a, 

• I interprets each predicate constant P as the extension of the corresponding 
relation (fl~6| to equivalence classes. 

Interpretation I satisfies iDCA, and D(I) = X. 

To prove the second claim, consider any iDCA-interpretation J such that D{ J) 
X . For any object constant a, let f(a) be the element of the universe of J that 
represents a. Function / can be extended to equivalence classes of relation (fT5|) . 
and this extension is an isomorphism between I and J. □ 

5.3 Proof of Theorem^ 

The proof of Theorem @] is based on the fact that a iDCA-interpretation I satisfies 
a first-order sentence F of the signature a iff the Herbrand interpretation D(iy§ q 
satisfies F^ q . This is easy to verify by induction on the size of F. What we need 
actually is a similar proposition for second-order sentences, because the formulas 
obtained by applying the operator SM p contain predicate variables. The straight- 
forward generalization to second-order sentences is invalid, however. For instance, 
let F be the formula 



3v(v(a) A ->«(&)) 



(17) 
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(v is a unary predicate variable). This formula is equivalent to o ^ b. If the universe 
of an interpretation / is a singleton then / does not satisfy F. On the other hand, 
the result of replacing = with lEq in F is F itself, because this formula does not 
contain equality. It is satisfied by every Herbrand interpretation, including D{I)^ q . 

To overcome this difficulty, we will define the transformation F h-> Fg for second- 
order sentences in such a way that it will involve, in addition to replacing = with lEq, 
restricting the second-order quantifiers in F . 

In this section, a second-order formula is a formula that may involve predicate 
variables, either free or existentially quantified, but not function variables. (An 
extension to universally quantified predicate variables is straightforward, but it is 
not needed for our purposes.) For any predicate variable v, iSub(v) stands for the 
formula 

V24 • • • x n yi ■ ■ ■ y n (v(x 1 , x n ) A \Eq(n, y\) A ••• A iEq(x n , y n ) -)• v(yi, y n )), 

where n is the arity of v. For any second-order formula F of the signature tr, F Eq 
stands for the second-order formula of the signature a U {iEq} obtained from F by 

• replacing each subformula of the form t\ — t% with iEq(t±, £2), and 

• restricting each second-order quantifier 3v to iSub(v). 

For instance, is F is (fT7|) then Fg is 

3v(iSub(v) A v(a) A ->v(b))- 

In application to first-order formulas, the notation Fg has the same meaning as 
before. 

Lemma 2 

A lDCA-interpretation / satisfies a second-order sentence F of the signature a iff 
the Herbrand interpretation D(I)^ satisfies Fjji . 

The proof of Lemma [2] is given in the online appendix. 

In the following lemma, as in the statement of Theorem |4j F is an arbitrary 
sentence of the signature a, and p stands for the set of all predicate constants of a. 

Lemma 3 

For any lDCA-interpretation / of the signature a, 

7hSM p [F] iff D{I)% hSM p [(FA^)i 9 ]. 

Proof 

Recall that SM p [.F] is defined as 

F A ^3v((v < p) A F*(v)) 
(jFerraris et al. 2011| Section 2.3), so that SM p [(F A E^ ] is 



FE q A {E a )= Eq A -.3v((v < p) A F*(v)^ q A E*{w)% q ) 



(18) 
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From the definitions of E a and of the transformation F i-> F*(v) ([Ferraris et al. 20111 
Section 2.3) we see that E*(v) is the conjunction of E a and the formulas 

Vxy(w(x) A x = y — > v(y)) 

for all members v of tuple v. Consequently E*(v) Eq is the conjunction of {E cr ) E : q 
and the formulas iSub(v) for all members v of tuple v. It follows that ([T5)l can be 
written as 

F5, A (^)S, A -dv |(v< P )Ar(v)|, A(^)i 8 A/\iSu&(«)J • 
This formula is equivalent to 

^ A -3v (^/\iSub(v) A ((v < p) A F*(v))i^ A (E a ) Eq , 
which can be written as 

SM p [J]i, A {E a )= Eq - 

The interpretation D(I) Eq satisfies the second conjunctive term. By LemmaO D(I) Eq 
satisfies the first conjunctive term iff I satisfies SM P [.F]. □ 

Theorem|4l For any iDCA-interpretation I of the signature a that satisfies SM P [.F], 
the Herbrand interpretation D(I) E l q of the signature a U {iEq} satisfies 

SM p [(F A E a ) Eq }- 

Conversely, any Herbrand model of this formula is D(I) E : q for some iDCA-interpreta- 
tion I of a satisfying SM p [F]. 

Proof 

The first assertion is identical to the only- if part of Lemma |3l To prove the second 
assertion, consider a Herbrand model J of SM p [(F A E a ) Eq \. Since this formula 
entails (E a ) = ^ q , J is a model of (E a )^ q as well. It follows that the subset X of 
\EDB a such that X Eq — J is closed under substitution, and the set of equalities 
in X is closed under reflexivity, symmetry, and transitivity. By Theorem [3l there 
exists a iDCA-interpretation / such that D(I) — X , so that D{I) E : q — J. By the if 
part of Lemma [3l I satisfies SM p [F] . □ 



6 Conclusion 

This paper contributes to the direction of research on the semantics of null values 
started in (jReiter 1984ft and ( |Traylor and Gclfond 1994]). More recently, null values 
were studied in the framework of the Datalog+/— project (|Gottlob et al. 20i"0|) . 

We have demonstrated a close relationship between Reiter's semantics of disjunc- 
tive databases and cardinality constraints in answer set programming. It shows also 
how answer set solvers can be used for computing models of relational theories with 
null values. 
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On the other hand, this paper improves our understanding of the role of non- 
Herbrand stable models. Are they merely a mathematical curiosity, or can they 
have serious applications to knowledge representation? We have provided arguments 
in favor of the usefulness of this generalization of the stable model semantics by 
showing, first, how non-Herbrand stable models can serve for representing null 
values, and second, how they can be generated using existing software systems. 

The generalization of the stable model semantics proposed in (jFerraris et al. 201 ip 
extends the original definition of a stable model in two ways: syntactically (it is ap- 
plicable to arbitrary first-order formulas) and semantically (a stable model can 
be non-Herbrand). The preprocessor f2lp (|Lee and P alla 2009) allows us to use 
existing answer set solvers for generating stable models of some syntactically com- 
plex formulas. On the other hand, the preprocessor nonH.py, described in this 
paper, allows us to use answer set solvers for generating some non-Herbrand sta- 
ble models those that satisfy the domain closure assumption but not the unique 
name assumption. The two programs can be used together. For instance, the stable 
lDCA-models of the formula 

(P(o) AP(b)) V (P(c) A 

(there are 23 of them) can be generated by running f2lp on the file 

(p(a) & p(b)) I (p(c) & p(d)) . 

and then running consecutively nonH.py, gringo, and CLASPD. 
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\ Lemma 2 

O ■ A lDCA-interpretation / satisfies a second-order sentence F of the signature a iff 

the Herbrand interpretation D{I) = ^ q satisfies . 

7— I 1 

■ Proof 

The proof is by induction on the size of F; size is understood as follows. About 
second-order sentences F and G we say that F is smaller than G if 

■ • F has fewer second-order quantifiers than G, or 

• F has the same number of second-order quantifiers as G, and the total number 

■ of first-order quantifiers and propositional connectives in F is less than in G. 

The induction hypothesis is that the assertion of the lemma holds for all sentences 
that are smaller than F. If F is atomic then 

X : I h F iff F e D(I) 



iff F^ q eD(I)^ q 
iff D{I)% q \= F%- 

If F is G A H then F^ q is G^ q A . Using the induction hypothesis, we calculate: 

I \= F iff I \= G and I \= H 

iff D(I)% q \= G^g and D(I)^ q \= H^ q 
iff D(J)I ? |=Ff 9 . 

For other propositional connectives the reasoning is similar. If F is WxG(x) then 
Fg is Vx (G(x)e Using the induction hypothesis and the fact that I satisfies 
iDCA, we calculate: 

I \= F iff for all object constants a, I |= G(a) 

iff for all object constants a, D{I)^ q \= G{a)^ q 
iff D{I)= Eq ^F^ q - 

For the first-order existential quantifier the reasoning is similar. 
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It remains to consider the case when F is 3vG(v), where v is a predicate variable. 
To simplify notation, we will assume that the arity of v is 1. For any set V of object 
constants, by iexp v we denote the lambda-expression!]] \x\J aeV (x = a). Since I is 
a lDCA-interpretation, / |= F iff 

for some V, I \= G(iexp v )- 

By the induction hypothesis, this is equivalent to the condition 

for some V, D{I)^ q \= H((iexp v )^ q ), (1) 

where H{v) stands for G(v)^, q . On the other hand, F^ q is 3v(iSub(v) AH(v)). The 
Herbrand interpretation D{I)^ q satisfies this formula iff 

for some V, D(I)g q \= iSub(iexp v ) and D{I)^ q \= H(iexp v ) ■ (2) 

We need to show that © is equivalent to ([1]). 
Consider first the part 

^(-O&j h iSub(iexp v ) (3) 
of condition ([2]), that is, 

-CK-Ob, \=Vxy(iexp v (x) A iEq(x,y) -nexp v (y))- 
It is equivalent to 

D (I)lsq \=yy{^x(iexp v {x) A iEq(x, y)) -> iexp v {y))- 

Interpretation D(I)^ q satisfies the inverse of this implication, because it satisfies 
~ixiEq(x, x). Consequently condition ([3]) can be equivalently rewritten as 

^(-Oig 1= Vy(3x(iexp v (x) A iEq(x, y)) O iexp v (y))- 

The left-hand side of this equivalence can be rewritten as Vaev iEq(a, y). It follows 
that condition © is equivalent to 

DWsg hVy (\/ aeV iEq(a,y) <+iexp v (y))- 

Furthermore, iEq(a, y) can be replaced here by iEq(y, a), because D(I)^, q satisfies 
Vxy(iEq(x, y) <-> iEq(y, x)). Hence (O is equivalent to 

D(I)E q \= ( lex Pv)Eq = iexpv' 

It follows that ([2]) is equivalent to the condition 

for some V, D(I)% q \= (iexp v )^ q = iexp v and D(I)% q \= H((iexp v )^ q ) ■ (4) 

It is clear that (j4j implies (J]). 

It remains to check that (JIJ implies ([JJ . Assume that 

D{I)= Eq ^H{{iexp v )%l (5) 

and let V' be the set of object constants a such that, for some b £ V, I \= a = b. 
We will show that V can be taken as V in (j4|). The argument uses two properties 
of the set V that are immediate from its definition: 

1 On the use of lambda-expressions in logical formulas, see (?, Section 3.1). 
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(a) V C V'; 

(b) if I \= a = b and a e V then b G V. 
Consider the first half of (|4|) with V" as 

^(^i? h (ieayv)I g = iexp K ,- 
This condition can be restated as follows: for every object constant a, 

^OO^hV^^M) iff ^)I,hV te r(^^ 

or, equivalent ly, 

The implication left-to-right follows from property (b) of V; the implication right- 
to- left is obvious (take b to be a). 

Consider now the second half of Q with V as V: 

D(ir Eq h h((i 

eX PV')Eq)' 

To derive it from ([S]), we only need to check that 

This claim is equivalent to 

/ |= iexp v , = iexp v (6) 
and can be restated as follows: for every object constant a, 

I\=\/ b eV'( a = b ) iff 'HV 6e y(«=&)- 

The implication left-to-right is immediate from the definition of V'; the implication 
righ-to-left is immediate from property (a). □ 



